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Abstract 

Introductory statistics needs innovative, evidence-based teaching practices that support and engage diverse students. 

To evaluate the success of a multidisciplinary, project-based course, we compared experiences of under-represented 
(URM) and non-underrepresented students in 4 years of the course. While URM students considered the material 
more difficult than non-URM students, URM students demonstrated similar levels of increased confidence in applied 
skills and interest in follow up courses as non-URM students. URM students were found to be twice as likely as non- 
URM students to report that their interest in conducting research increased. Increasing student confidence and interest 
gives all students a welcoming place at the table that will afford the best hope for achieving the kind of statistical 
literacy necessary for interdisciplinary research. 
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Introductory statistics needs innovative, evidence-based teaching practices that support and engage diverse students.To 
evaluate the success of a multidisciplinary, project-based course, we compared experiences of under-represented (URM) 
and non-underrepresented students in 4 years of the course.While URM students considered the material more difficult 
than non-URM students, URM students demonstrated similar levels of increased confidence in applied skills and interest 
in follow up courses as non-URM students. URM students were found to be twice as likely as non-URM students to re¬ 
port that their interest in conducting research increased. Increasing student confidence and interest gives all students a 
welcoming place at the table that will afford the best hope for achieving the kind of statistical literacy necessary for inter¬ 
disciplinary research. 


INTRODUCTION 

A central challenge of introductory statistics is the development of 
a curriculum that not only serves diverse students, but also sparks 
communication, reasoning and collaboration that clearly crosses 
traditional disciplinary boundaries. Previous authors have suggest¬ 
ed that this can best be achieved through inquiry-based projects 
(Bailey, Spence, & Sinn 2013) that allow students to “decompose 
their topic, identify key components; abstract and formulate dif¬ 
ferent strategies for addressing it; connect the original question 
to the statistical framework; choose and apply methods; reconcile 
the limitations of the solution; and communicate findings” (Nolan 
& Temple Lang, 2009). This type of project-based learning is most 
commonly defined as an instructional approach based on authentic, 
real-world activities that are aimed at engaging student interest and 
enthusiasm (BIE, 2012; Krajcik & Blumenfeld, 2006). Designed to an¬ 
swer a question or solve a problem, this approach allows students 
to face challenges that lead to answers, reflect on ideas and make 
decisions that affect project outcomes (Aditomo, Goodyear, Bliuc, 
& Ellis, 2013). There is an emerging literature showing that proj¬ 
ect-based learning in many contexts is more effective in promoting 
deep thinking, the ability to apply knowledge, communication and 
reasoning skills, when compared to traditional didactic approaches 
(e.g. Harada &Yoshina, 2004; Hickey,Wolfe, & Kindfield, 2000; Hick¬ 
ey, Kindfield, Horwitz, & Christie, 1999; Hmelo-Silver, Duncan, & 
Chinn, 2007; Langer, 2001; Lynch, Kuiper, Pyke, & Szesze, 2005;Walk- 
er & Leary, 2009; however, see Kirschner, Sweller, & Clark, 2006). 

Though accumulating research has shown that project-based 
activities, especially those providing a research experience (Rus¬ 
sell, Hancock, & McCullough, 2007), promote positive learning out¬ 
comes and may hold promise for better engaging students with 
varying levels of preparation (Hatfull et al., 2006; Jones, Rasmussen, 
& Moffitt, 1997; Mergendoller, Maxwell, & Bellisimo, 2006), little re¬ 
search has focused on group differences that may predict greater or 
lesser success for project-based learners. Consideration of group 
differences is particularly important in courses within STEM fields 
where retention of underrepresented minority students (URM) has 
been weak with URM students being less likely to persist in STEM 
fields at all levels (e.g., HERI, 2010; Schultz et al., 2011; daSilva and 


Pinto, 2014; Bilgin, et al, 2015). In fact, underrepresented students 
often exit science and mathematics following introductory cours¬ 
es (National Science Foundation - Division of Science Resources 
Statistics, 2004) and of the numerous reasons indicated for this 
early departure, the most often cited reason is uninspiring or un¬ 
suitable pedagogical practices (Graham, Frederick, Byars-Winston, 
Hunter, & Handelsman, 2013; Seymour & Hewitt, 1997). Students’ 
perceptions of teaching approaches and having agency in their own 
learning predict positive academic outcomes as well as satisfaction 
and skill development (Lizzio, Wilson, & Simons, 2002).Thus, more 
engaging teaching practices are needed, followed by demonstrat¬ 
ed success in providing supportive experiences for students from 
diverse backgrounds that increase confidence in applied skills and 
foster positive attitudes toward future learning. 

In previous publications, we described the development of a 
multidisciplinary, project-based introductory statistics course (Di- 
erker, et al., 2012) aimed at engaging students in applied statistical 
projects across both divisional and departmental boundaries, as 
well as its success in attracting higher rates of URM students than 
a traditional introductory statistics curriculum offered through a 
math department (Dierker, et al., 2015). Funded by the Nation¬ 
al Science Foundation and first introduced into the curriculum at 
a selective liberal arts college, the project-based course engages 
students with real data and code-based statistical software. Close¬ 
ly following the recommendations outlined in the Guidelines for 
Assessment and Instruction in Statistics Education (GAISE) report 
(Aliaga et al., 2005), the course is designed around student research 
projects of their own choosing and offers individualized hands-on 
experience in applied statistics. Projects are presented at the end 
of the semester at a research poster session in which students have 
the opportunity to describe their process of inquiry, including the 
different decisions made along the way, their premises, conclusions 
and any barriers that they faced. 

Rather than focusing on rules associated with traditional lists 
of statistical tools (e.g., z-test, one sample t-test, two sample t-test, 
paired t-test, etc.), we have organized the course according to the 
decisions and skills involved in statistical inquiry. Basic themes such 
as measurement and descriptive and graphical representation are 
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covered, as well as more specific inferential methods needed to 
test hypotheses and/or explore the empirical structure of data 
(ASA, 2014). All, however, are introduced as the student’s research 
questions dictate their presentation. In this way, students are pro¬ 
vided with opportunities to learn to evaluate what tools would be 
most appropriate for their research question(s) and to engage in 
decision making. While not all students utilize the entire menu of 
tools offered in this course, through on-line materials, in and out of 
class support and collaboration with peers, they are exposed to a 
wide variety of methods, and learn to choose and use them flexibly 
as they are needed.This approach is aimed at building student con¬ 
fidence in their ability to evaluate data and seek out appropriate 
methods for the questions at hand. 

Importantly, the project-based course is aimed at taking ad¬ 
vantage of students’ natural curiosity and providing a common lan¬ 
guage for approaching questions across numerous disciplines.This 
is achieved by asking students in the first week of class to develop 
their own research question from a number of large data sets rep¬ 
resenting different disciplines such as ecology, psychology, econom¬ 
ics, planetary science and more.Thinking with data to answer a re¬ 
search question is an important component of the course as is the 
connection to the applied content (ASA, 2014). In addition, great 
care is taken to present translations of terminology and vocabulary 
that are used across different disciplines for similar statistical con¬ 
cepts (e.g. independent and dependent variables vs. predictors and 
outcomes vs. stimulus and response variables). 

Based on survey and administrative data from students en¬ 
rolling in the project-based course across 4 academic years, the 
present paper compares background characteristics, learning ex¬ 
periences and course outcomes for URM and non-URM students. 
The unique nature of the multi-disciplinary project-based course 
and its potential opportunity to contribute to increased retention 
in STEM makes consideration of the background, experiences, and 
outcomes of URM and non-URM students particularly important. 
Course experiences included measures such as the usefulness of 
resources and difficulty ratings, while measures of course outcomes 
focused on increases in confidence, knowledge level, and students’ 
interest levels in conducting research, using statistics, and taking 
follow-up courses.We hypothesize that despite potential differenc¬ 
es in background characteristics of these two groups, experiences 
with the course and course outcomes will be similarly positive. 
We used multivariate regression analyses to address whether dif¬ 
ferences related to URM status were present after controlling for 
students’ other background characteristics and course experiences 
in order to have a fuller understanding of how the course engaged 
URM and non-URM students. 

METHODS 

Participants 

Data were drawn from administrative records and student surveys 
completed before and after the multidisciplinary, project-based 
course (N = 333) between fall semester 2010 and fall semester 
201 3. The pre course survey was completed prior to the end of 
the first week of classes and the post course survey during the last 
week of the semester. Each survey took approximately 10-15 min¬ 
utes to complete. The sample included 74 (22.2%) students who 
self-reported as under-represented (i.e. African American and/or 
Hispanic). The remaining 259 students (77.8%) self-reported as 


White, Asian or other and were designated for the purposes of 
the present analyses as non-underrepresented. A total of 202 stu¬ 
dents (61.0%) were female, and 140 (42.0%) were in their first or 
second year of college. 

The multidisciplinary, project-based introductory statistics 
course was offered through the Quantitative Analysis Center, a col¬ 
laborative effort of academic and administrative departments that 
supports quantitative analysis across the curriculum and provides 
an institutional framework for collaboration across departments 
and disciplines in the area of statistics and data analysis.Titled Ap¬ 
plied Data Analysis, the course was described in the university’s 
on-line catalogue as a “project-based course, [in which] you will 
have the opportunity to answer questions that you feel passion¬ 
ately about through independent research based on existing data. 
Students will have the opportunity to develop skills in generating 
testable hypotheses, conducting a literature review, preparing data 
for analysis, conducting descriptive and inferential statistical anal¬ 
yses, and presenting research findings. The course offers unlimit¬ 
ed one-on-one support, ample opportunities to work with other 
students, and training in the skills required to complete a project 
of your own design.These skills will prepare you to work in many 
different research labs across the University that collect empirical 
data. It is also an opportunity to fulfill an important requirement 
in several different majors.” The course was open to all students 
and there were no prerequisites for enrollment. It represented one 
option to fulfill a major requirement for biology, earth and environ¬ 
mental science, government, neuroscience and behavior, sociology, 
and psychology or could be applied to the natural sciences and 
mathematics general education recommendation.The course was 
not specifically required of any student and did not represent the 
only option for fulfilling requirements for any major within the uni¬ 
versity. The average enrollment per class section was 17.8 (s.d. = 
4.8, range = 6 - 26). 

This project-based course differs from standard introductory 
statistics courses in several ways.Where traditional statistics cours¬ 
es often employ a building-block approach which covers statistical 
tools in a serial manner and in the absence of a context in which to 
apply them,this course entails learning statistics as students answer 
their own questions and choosing among statistical tools present¬ 
ed in a parallel manner.The course works to promote learning by 
creating a context in which students familiarize themselves with 
several possible data sets, formulate a statistical research question, 
choose among several available statistical tools, apply an appropri¬ 
ate method, and communicate their findings. Based on the student’s 
choice of data, each generates testable hypotheses for their cho¬ 
sen dataset; conducts a literature review on their topic of interest; 
works to refine or broaden their research questions based on in¬ 
formation they collect; prepares data for analysis (i.e., data manage¬ 
ment); selects and conducts descriptive and inferential statistical 
analyses; and evaluates, interprets and presents research findings. 
These activities are not presented or experienced as distinct stages 
but rather, as a series of ongoing, interactive tasks. 

Learning materials and teaching strategies were designed to be 
structured enough to allow students to consistently move forward 
with their research projects, yet broad enough to encourage them 
to creatively and independently explore their questions, letting stu¬ 
dents actively drive the decisions involved in inquiry. In this way, the 
support each student receives is dictated by their own research 
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question and the results at each stage of their project. 

The semester-long course met 3 to 4 times a week for a total 
of 4 hours. On-line lecture clips were used to support a flipped 
classroom approach (e.g., Abeysekera & Dawson, 2014) and were 
designed to provide students with adequate substantive and prac¬ 
tical background for engaging in workshop-oriented class sessions 
and completing exams and cumulative project-based assignments 
toward the completion of their research project. During class ses¬ 
sions instructors and peer mentors supported students in mak¬ 
ing progress on their research project. Additional individualized 
support from statistical tutors was available during drop-in hours 
throughout the semester. 

Measures 

Measures included gender and the following variables: 

Background Characteristics. Class year was dichotomized 
into 1st and 2nd year vs. upper classmen. Students’ high school 
backgrounds were collapsed based on whether they attended a 
public vs. non-public high school.The non-public category included 
private schools, religious schools, and home-schools. Financial aid 
status was characterized as students with demonstrated need re¬ 
ceiving grants and/or self-help financial aid vs. those enrolled in the 
university without financial assistance. 

Students were asked to indicate on the pre course survey 
whether they had taken an Advanced Placement (AP) statistics 
course during high school and perceived skills in mathematics were 
measured by the questions. “How well did you do in mathematics 
courses you have taken in the past?” and “How good at mathemat¬ 
ics are you?” both rated on a scale from I (very poor) to 7 (very 
well/good). Math SAT scores were drawn from administrative data. 

Prior experience with programming and/or code-based sta¬ 
tistical software was evaluated in the pre course survey. Individual 
programs endorsed by 4 or more students include Java, Python, C/ 
C++, Matlab, R, SAS, Stata, and SPSS. 

Students were asked “How likely is it that you would have 
taken any course in statistics if such a course were not required”. 
Response options ranged from I = not at all likely to 7 = very likely. 

Experiences with the course. Based on the post course 
survey, students rated the usefulness of the flipped classroom ses¬ 
sions in completing both the research project and course exams on 
a scale from I = not at all useful to 5 = extremely useful. Usefulness 
of course resources including readings, video lectures and model 
statistical code was also rated on a scale from I = not at all useful 
to 5 = extremely useful and averaged across individual respons¬ 
es related to both usefulness in terms of completing the research 
project and the exams. 

Instructors, peer mentors, and tutors were also available out¬ 
side of class sessions to provide additional one-on-one support. 
Students were asked to report the number of times they met with 
each and, if they used any support, were asked to rate how helpful 
this support was to them.Variables were dichotomized into wheth¬ 
er or not a student visited each (i.e., instructors, peer mentors and 
tutors) and the helpfulness of each was rated on a scale from I = 
not at all helpful to 9 = very helpful. 

Students’ perceptions of course rigor were measured with the 
questions “How difficult for you was the material covered in this 
course?” (from I = very easy to 7 = very difficult),“Was this course 
more challenging, less challenging or similarly challenging compared 


to other college courses you have taken?” and “How challenging 
was the research project?” from I = not at all challenging to 5 =the 
most challenging project the student had ever completed. 

Students’ overall impressions of the course were measured 
with the questions “Did you accomplish more than you expected, 
less than you expected or about the same as you expected?” and 
“Did you find this course more useful, less useful or similarly useful 
compared to other college courses you have taken?” 

Course outcomes. Increases in perceived confidence in sev¬ 
eral data analysis and statistical skills were evaluated based on stu¬ 
dent ratings from the pre to post course survey.These skills were 
evaluated with the questions how confident are you in examining 
codebooks, developing research questions, managing data, conduct¬ 
ing research, conducting statistical analyses, graphing, interpreting 
results, effectively presenting statistical results, and understanding 
the discipline of statistics, each rated on a scale from I = not at all 
confident to 4 = very confident. For students not rating themselves 
at the maximum confidence level in the pre-survey (leaving 72.2% - 
98.6% of the sample) individual dichotomous variables were creat¬ 
ed for each skill indicating whether or not the student’s confidence 
increased between the pre and post course surveys. 

Knowledge of data analysis and statistical concepts was evalu¬ 
ated through 3 multiple-choice, in-class exams. 

Students’ interest in conducting research and using statistics 
was measured in both the pre survey and the post survey by two 
questions. “Are you interested in conducting research?” (from I 
= not at all interested to 4 = very interested) and “In the field in 
which you hope to be employed when you finish school, how much 
will you use statistics?” (from I = not at all to 7 = great deal). For 
students not rating themselves at the maximum interest level in the 
pre survey (64.7% of the sample on research interest and 90.2% of 
the sample on statistics use in employment), dichotomous variables 
were created for each question indicating whether or not the stu¬ 
dents’ interest increased between the pre and post surveys. 

Finally, students were asked if they would like to take one or 
more courses as a follow-up to the project-based statistics course. 
Options included a course in programming, data set construction, 
data visualization, science writing, advanced statistics or other. Indi¬ 
vidual courses were examined separately and an aggregate variable 
was constructed indicating whether or not a student endorsed an 
interest in taking at least one follow-up course. 

Analyses 

Chi-square Tests of Independence and ANOVA were used to ex¬ 
amine the association between URM status and categorical and 
continuous variables measuring student’s background characteris¬ 
tics, experience with the course and course outcomes. Multivari¬ 
ate linear or logistic regression models, for continuous and binary 
outcome variables respectively, were then run to evaluate whether 
effects of URM status persisted when controlling for other signifi¬ 
cant background characteristics and course experiences. 

RESULTS 

Background Characteristics 

Sixty-one percent of students enrolling in the course were female, 
56.8% had attended a public high school and 22.5% reported having 
taken AP statistics during high school. Forty-two percent of the 
sample enrolled in the project-based course during their first or 
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second year of college (i.e., freshman or sophomore class status), 
and half were receiving financial aid. The average Math SAT score 
was 689 (s.d. 70.4, range 500 to 800) and the mean rating on the 
questions “How well did you do in mathematics courses you have 
taken in the past?” and “How good at mathematics are you?” was 
4.6 (s.d. 1.44) and 4.8 (s.d. 1.23), respectively, both rated on a scale 
from I =very poor to 7 =very well/good.The mean rating on the 
question “How likely is it that you would have taken any course in 
statistics if such a course were not required” was 4.4 (s.d. 1.77) 
rated on a scale from I = not at all likely to 7 = very likely. 3 I. I % of 
students reported at least some prior experience with a program¬ 
ming language or code-based statistical software. 

A comparison of these background characteristics for un¬ 
der-represented (URM) and non-underrepresented students en¬ 
rolling in the multidisciplinary, project-based course is presented 
in Table I. URM and non-URM students were similarly likely to be 
female, have attended a public high school, and to have taken an AP 
statistics course. URM and non-URM students were also similarly 
likely to enroll in the multidisciplinary project-based course during 
their first two years of college and to report having had some 
experience with at least one programming language or code-based 
statistical software. However, URM students were more than twice 
as likely as non-URM students to be receiving financial aid. Further, 
URM students enrolling in the project-based course had significant¬ 
ly lower math SAT scores than non-URM students and rated them¬ 
selves as less good at math (Table I). URM students also reported 
that they would have been less likely than non-URM students to 
take a course in statistics if it were not required. 


Student Experiences with the Course 

Based on post survey responses, a comparison of self-reported ex¬ 
periences with the course by underrepresented student status is 
presented in Table 2. URM and non-URM students reported that 
the flipped classroom sessions (sample M = 3.6, s.d. = 0.97) and 
supporting course resources (sample M = 3.3, s.d. = 0.79) were 
similarly useful on a scale from I = not at all useful to 5 = extremely 
useful. Further, both groups were similarly likely to take advantage 
of out-of class support by visiting instructors (sample rate 63.8%), 
peer mentors (sample rate 60.3%) and statistics tutors (sample 
rate 51.2%) at least once and rated the support that they received 
in these sessions as similarly useful.While URM and non-URM stu¬ 
dents rated the research project as similarly challenging (sample M 
= 2.7, s.d. = 0.78 on a scale from I = not at all challenging to 5 = the 
most challenging project the student had ever completed), URM 
students described the material covered in the course as more 
difficult than non-URM students and were also more likely to rate 
the overall course as more challenging than other college courses 
they had taken (Table 2). 

While URM and non-URM students were just as likely to rate 
the course as more useful than others they had taken in college 
(overall 50.8%), URM students were significantly more likely to feel 
that they had accomplished more relative to what they had expect¬ 
ed (Table 2). 

Course Outcomes: Knowledge, Confidence, 
Attitudes, and Future Plans Following the Course 

A comparison of course outcomes by underrepresented student 


TABLE 1 . Background characteristics of students enrolled in 
student (URM) status. 

the multidisciplinary project-based course 

by underrepresented 


URM 1 
n = 74 

non-URM 
n = 259 

Statistics 


n (%) 2 

n(%) 


Gender (% female) 

48 (67.4%) 

148 (58.5%) 

ns 

1 st or 2nd year student 

36 (48.6%) 

104 (40.2%) 

ns 

Public High School 

38 (51.4%) 

151 (58.3%) 

ns 

Financial Aid 

62 (83.8%) 

104 (40.2%) 

X 2 (l) = 42.1, p < .001 

AP Statistics 

13 (17.8%) 

61 (23.8%) 

ns 

Programming Experience 

24 (32.4%) 

80 (30.9%) 

ns 

Java 

8(1 1.1%) 

26 (10.2%) 


Python 

2 (2.8%) 

17 (6.7%) 


C/C++ 

0 

6 (2.3%) 


Matlab 

1 (1.4%) 

4 (1.6%) 


R 

0 

4 (2.1%) 


SAS 

4 (5.6%) 

1 1 (4.3%) 


Stata 

5 (6.9%) 

16 (6.3%) 


SPSS 

7 (9.7%) 

19 (7.4%) 



Mean (s.d.) 

Mean (s.d.) 


Math SAT 

621 (77.2) 

709 (53.6) 

F( 1,274) = 107, p < .001 

How good at mathematics are you? (1 = very poor to 7 = very good) 

4.5 (1.12) 

4.9 (1.25) 

F( 1,323) = 4.04, p < .05 

How likely you would take a course in statistics if not required (1 = not 
at all likely to 7 = very likely) 

3.9 (1.88) 

4.6 (1.71) 

F( 1,325) = 7.96, p < .01 


'Under-represented students includes those self-identifying as Black or Hispanic. 
Percentages are based on the number of respondents completing each item. 
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TABLE 2. Self-reported student experiences with the multidisciplinary project-based 
status. 

course by under-represented student (URM) 


URM 1 
n = 74 

non-URM 
n = 259 

Statistics 

Usefulness of flipped class sessions M (s.d.) 

(1 = not at all useful to 5 = extremely useful) 

3.7 (0.93) 

3.5 (0.99) 

ns 

Usefulness of course resources 2 M (s.d.) 

(1 = not at all useful to 5 = extremely useful) 

3.3 (0.71) 

3.3 (0.81) 

ns 

Out of class support 

Visited instructor n (%) 3 

50 (67.6%) 

160 (62.7%) 

ns 

Helpfulness of support M (s.d.) 

(1 = not at all helpful to 9 = very helpful) 

8.1 (1.49) 

7.8 (1.75) 

ns 

Visited peer mentor n (%) 

40 (54.8%) 

126 (50.2%) 

ns 

Helpfulness of support M (s.d.) 

(1 = not at all helpful to 9 = very helpful) 

7.6 (2.01) 

7.2 (1.67) 

ns 

Visited statistics tutor n (%) 

50 (69.4%) 

146 (57.7%) 

ns 

Helpfulness of support M (s.d.) 

(1 = not at all helpful to 9 = very helpful) 

7.5 (2.13) 

6.9 (2.25) 

ns 

Rigor 

How difficult for you was the material covered in this course? M (s.d.) 

(1 = not at all difficult to 7 = very difficult) 

4.3 (1.25) 

3.9 (1.36) 

F( 1,329) = 6.01,p = .015 

More challenging compared to others college courses n (%) 

23 (31.9%) 

28 (1 1.0%) 

X 2 (l) = 17.2, p < .001 

How challenging was the research project? M (s.d.) 

(1 = not at all to 5 = the most challenging) 

2.8 (0.74) 

2.7(0.78) 

ns 

Overall 

n (%) 

n(%) 


Accomplished more relative to what you expected 

38 (53.5%) 

86 (33.7%) 

X 2 (l) = 8.4, p < .004 

More useful than other college courses 

32 (44.4%) 

134 (52.5%) 

ns 


'Under-represented students includes those self-identifying as Black or Hispanic. 
2 Averaged across individual items of readings, lectures, and sample statistical code. 
Percentages are based on the number of respondents completing the particular item. 


status are presented in Table 3. Based on responses in both the pre 
and post course surveys, URM and non-URM students reported in¬ 
creased confidence following the course at similar rates for each of the 
skills covered (i.e. overall increases for examining codebooks 81.2%, 
developing research questions 68.0%, managing data 60.9%, conduct¬ 
ing research 46.9%, conducting statistical analyses 65.8%, graphing 
53.3%, interpreting results 51.1%, effectively presenting results 48.8%, 
and understanding the discipline of statistics 59.2%).While both URM 
and non-URM students performed well on exams measuring knowl¬ 
edge of statistical concepts, URM students received significantly lower 
scores than non-URM students on each of the three multiple choice 
exams administered during the course (Table 3). 

Notably, URM students had reported lower levels of interest in 
conducting research than non-URM students in responses in the pre 
course survey (M = 2.9, s.d. = 0.78 vs. M = 3.1, s.d. = 0.81), F( 1,323) 
= 4.21, p = .04), and were significantly more likely to report that their 
interest in conducting research had increased following the course 
(Table 3). Approximately 40% of both URM and non-URM students 
showed increases in how much they believe that they will use statis¬ 
tics in the field in which they hope to be employed. 

Finally, more than 80% of students enrolling in the project-based 
course reported being interested in one or more follow-up courses 
including computer programming, constructing data sets, data visu¬ 
alization, science writing and advanced statistics. Though URM and 
non-URM students endorsed the majority of these courses at simi¬ 
lar rates, URM students were significantly less likely than non-URM 


students to be interested in taking a computer programming course 
following the project-based course (Table 3). 

Multivariate Analyses Examining Predictors of 
Course Outcomes 

Multivariate regression models were built by first entering back¬ 
ground characteristics based on the pre course survey and admin¬ 
istrative data into each multivariate model. Only those variables that 
reached statistical significance were retained in subsequent models. 
Next, items from the post survey assessing students’ experience with 
the course were simultaneously entered into models for each out¬ 
come on which there were significant differences related to URM 
status. 

While the bivariate analyses demonstrated differences based on 
URM status for several background characteristics, learning experi¬ 
ences, and course outcomes, URM status did not significantly predict 
any of the course outcomes with the exception of increase in inter¬ 
est in conducting research when controlling for the other predictors. 
That is, after controlling for pre course interest in research, other 
significant background characteristics and all measures of experiences 
with the course, URM students were found to be 2.2 times (Cl l.l- 
4.6) more likely to report that their interest in conducting research 
had increased following the course. In addition, higher ratings on the 
usefulness of course resources (OR=l.6 Cl 1.01-2.56), lower ratings 
of the challenge involved in the research project (OR=0.6 Cl 0.38- 
0.97) and reporting that the course was more useful than others tak- 
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TABLE 3. Achievement, confidence, attitudes and future plans 
represented student (URM) status. 

following the multidisciplinary project-based 

course by under- 


URM 1 

non-URM 



n = 74 

n = 259 

Statistics 

Increased confidence pre to post 

n (%) 2 

n (%) 2 


Examining data codebooks 

56 (77.8%) 

203 (82.2%) 

ns 

Developing a research question 

41 (67.2%) 

146 (68.2%) 

ns 

Managing data 

43 (62.3%) 

147 (60.5%) 

ns 

Conducting research 

28 (44.4%) 

101 (47.6%) 

ns 

Conducting statistical analyses 

44 (62.0%) 

164 (66.9%) 

ns 

Graphing 

33 (48.5%) 

130 (54.6%) 

ns 

Interpreting results 

31 (44.9%) 

128 (52.9%) 

ns 

Effectively presenting results 

25 (54.3%) 

73 (47.1%) 

ns 

Understanding the discipline of statistics 

41 (57.5%) 

142 (59.7%) 

ns 

Exam 1 M correct (s.d.) 

86.9% 3 (7.8) 

91.5% (5.9) 

F( 1,304) = 28.5, p < .01 

Exam 2 M correct (s.d.) 

83.7% (12.6) 

90.3% (7.7) 

F( 1,301) = 28.5, p < .01 

Exam 3 M correct (s.d.) 

85.6% (9.0) 

90.0% (6.7) 

F( 1,299) = 18.8, p < .01 

Attitudes pre to post 

" (%) 2 

n (%) 2 


Increased interest in conducting research 

22 (42.3%) 

46(29.1%) 

X 2 (l) = 5.17,p = .02 

Increased expectation of using statistics in future employment 

29 (43.9%) 

87 (38.5%) 

ns 


n (%) 

n (%) 


Interest in taking a follow-up course 

56 (76.7%) 

216 (83.7%) 

ns 

Programming 

18 (24.7%) 

107 (41.5%) 

ns 

Data set construction 

21 (28.8%) 

78 (30.2%) 

ns 

Data visualization 

22 (30.1%) 

90 (34.9%) 

ns 

Science Writing 

13 (17.8%) 

57 (22.1%) 

ns 

Advanced Statistics 

24 (32.9%) 

1 13 (43.8%) 

ns 

Other 

8 (10.8%) 

16 (6.2%) 

ns 


'Under-represented students includes those self-identifying as Black or Hispanic. 

2 n and % based on number of students who were able to increase their confidence rating between the pre and posttest (i.e. those not rating them¬ 
selves at the maximum confidence level in the pretest. 

Percentages are based on the number of respondents completing each item. Bonferoni adjusted p values are presented to correct for multiple tests 
when there are sets of items (increased confidence, exam scores, follow-up courses). 


en in college (OR=2.6 Cl 1.32-5.30) also independently predicted 
the likelihood of increased student interest in conducting research. 

When examining the association between students’ back¬ 
ground characteristics and average exam scores (the other course 
outcome on which URM and non-URM students had differed in 
the bivariate analyses) non-URM student status, higher Math SAT 
scores, and junior or senior class status each independently pre¬ 
dicted a higher average score on multiple choice exams. However, 
when entering measures of student experiences with the course 
into the model, URM status was no longer associated with perfor¬ 
mance on exams, though Math SAT (Beta .0004, p<.0001) and class 
status remained significant (Beta 0.02, p<.04). In addition, higher 
ratings on the usefulness of flipped classroom sessions (Beta 0.01, 
p<.009), lower ratings on having accomplished more than expected 
(Beta -0.02, p<.03) and reports that the course was more useful 
than others taken in college (Beta 0.02, p<.004) were also found to 
independently predict higher exam scores. 

DISCUSSION 

We have described in previous reports the development of this 
project-based, introductory statistics course and its success in at¬ 


tracting higher rates of URM students compared to a traditional sta¬ 
tistics curriculum offered by a math department (Dierker et al., 2012; 
Dierker et al., 2015). Based on survey and administrative data from 
students enrolling in the project-based course across 4 years, the 
present study compared URM and non-URM students on background 
characteristics, learning experiences and course outcomes to evalu¬ 
ate the success of the course in increasing confidence in applied skills 
and fostering positive attitudes toward future learning for students 
from diverse backgrounds. 

Results indicated that URM students entered the course with 
lower levels of confidence in their math skills, lower Math SAT scores 
and a lower likelihood of taking a statistics course if one were not 
required than non-URM students. While URM students also rated 
the course as more difficult than non-URM students, the URM stu¬ 
dents were more likely to report accomplishing more than expected. 
URM students scored somewhat lower on multiple choice exams 
evaluating content knowledge outcomes; however this was no lon¬ 
ger significant after controlling for other background characteristics 
and course experiences. Both URM and non-URM students showed 
similarly high rates of increased confidence and interest in pursu¬ 
ing follow-up courses. Furthermore, after controlling for pre course 
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interest in research, student background characteristics and experi¬ 
ences with the course, URM students were found to be over 2 times 
more likely than non-URM students to report that their interest in 
conducting research had increased. Overall, half of the students in the 
sample reported the course to be more useful than other college 
courses they had taken and more than three-quarters of the sample 
reported interest in taking at least one follow-up course, with no 
differences based on URM status. 

Despite URM students rating themselves more poorly in terms 
of math skills and scoring somewhat lower than non-URM students 
on the math section of the SAT, it should be noted that because of the 
selective nature of the university in which this project-based course 
was piloted, SAT scores for both groups are quite high, with both 
mean and median scores above 600. In other words, scores for the 
URM students in this study were not low, but instead, relatively lower 
than those of non-URM students enrolling in the course. Notably, pre¬ 
vious research has shown that African American and Hispanic college 
students with high grade point averages and SAT scores above 600 
typically do not pursue STEM college majors for reasons including 
poor teaching in STEM courses, lack of encouragement from teachers 
or parents, and self-perception of their own inability to be successful 
in STEM majors (George, Neale,Van Horne, & Malcolm 200l;Grandy, 
1998). Thus, this is a particularly important group to target with an 
innovative, welcoming curriculum that sparks interest in continued 
study. Given that more than three-quarters of URM students in the 
present study reported being interested in one or more follow-up 
courses and there were no URM-based differences in increased con¬ 
fidence in particular research and statistical skills, our project-based 
approach seems to have been a largely positive experience for these 
students, consistent with the literature showing increased motivation 
and more positive attitudes towards a discipline when taught via a 
project-based approach (Walker & Leary, 2009).While we are not yet 
able to evaluate the potential impact of the project-based course in 
terms of actual future academic decisions-making, the present find¬ 
ings suggest that a course in which students “have the opportunity to 
answer questions that [they] feel passionately about through indepen¬ 
dent research based on existing data”, may represent a promising op¬ 
tion for engaging diverse students in the process of statistical inquiry. 

Overall though, only 42% of students enrolling in the proj¬ 
ect-based course did so during their first two years of college. Given 
that participation by freshman and sophomores in introductory sta¬ 
tistics courses has the best chance of influencing students’ selection 
of both future courses as well as a major discipline, by extension, it 
also has the most hope for reversing the ‘leaky pipeline’ in which 
large numbers of students turn away from STEM courses (Blickenstaff, 
2005).More needs to be done to encourage enrollment of students 
in statistics courses as early as possible in their academic careers 
(Seymour, Hunter, Laursen, & Deantoni, 2004; ASA, 2014; Hurtado et 
al„ 2008). 

Though some differences were found between URM and non- 
URM students’ experiences with the course including URM students 
rating the course material as more difficult and being more likely to 
view the course as more challenging than others they had taken in 
college, those experiences as well as significant differences in back¬ 
ground characteristics between URM and non-URM students did not 
significantly predict course outcomes in multivariate models. Instead, 
ratings of the usefulness of flipped classroom sessions and supporting 
course resources and judging the course to be more useful than other 


college courses (measures on which the URM and non-URM students 
did not differ) were the most consistent predictors of positive course 
outcomes. One or more of these measures independently predicted 
higher exam scores and an increased interest in conducting research. 

Notably, while bivariate analyses showed that URM students 
were found to be more likely to feel that they accomplished more 
than expected in the course, it was lower ratings of accomplishing 
more than expected that was found to predict higher exam scores. 
This coupled with a positive association between both Math SAT 
scores and higher class status (i.e. junior and senior) in accounting for 
significant variability in exam scores suggests that test scores were 
potentially driven by experience and confidence in taking traditional 
tests. 

Taken together, we see that it is important to examine addition¬ 
al experience and outcome measures, beyond the typical content 
knowledge exams. This is especially relevant as the most beneficial 
effects of project-based learning are on measures that consider the 
application, as opposed to the memorization, of knowledge (Dochy, 
Segers.Van den Bossche, & Gijbels, 2003) and it is potentially these 
additional application outcomes and attitudes that are most import¬ 
ant to continued persistence in statistics. Given that no single course 
will adequately prepare students for either the amount or complexity 
of data they will encounter as professionals and as citizens (Resnick, 
1987; Collins & Halverson, 2010), modern courses need to focus on 
imparting a deep interest among students, belief in the importance 
of the discipline and a desire to continue learning statistics and disci¬ 
plines focused on data and computation. 

Though URM status was not found to be independently associ¬ 
ated with the vast majority of course outcomes, after controlling for 
pre course rating of interest in research, significant background char¬ 
acteristics and students’ experiences with the course, URM students 
were found to be more than two times more likely than non-URM 
students to report that their interest in conducting research had in¬ 
creased. Students’ increased interest in research is critical since early 
participation in research has been shown to be an important factor 
in retaining students in STEM (Hunter, Laursen, & Seymour, 2007; Pas- 
carella & Terenzini, 2005) given that it increases skills and a sense of 
identity as a researcher (Graham et al., 2013; Hurtado, Cabrera, Lin, 
Arellano, & Espinosa, 2009). The applied focus of this project-based 
course adds to a literature already showing that research experience 
in courses (e.g., Hatfull et al., 2006; Schaffer et al., 2014) can provide 
benefits similar to those achieved through independent research in 
a faculty lab (Linn, Palmer, Baranger, Gerard, & Stone, 2015; Lopatto, 
2007; Seymour et al., 2004). 

Though we believe that this inclusive and comparative approach 
allows students to widely communicate across disciplines on a variety 
of computational issues, we also recognize that there is more to be 
done to enhance access to truly interdisciplinary research and think¬ 
ing. Project-based learning, of course, presents its own challenges and 
customarily, courses offering opportunities for independent statistical 
inquiry have required a smaller group format (e.g. 20 or fewer stu¬ 
dents) compared to courses that rely on more traditional pedagogies. 
As noted in the methods, during development of our project-based 
course, the average class sections did not go above 26. More recent 
offerings of the project-based course have increased most sections 
sizes to 30 students (fall 2014) with great success, and with fall 2015, 
we have moved out of the computer lab into collaborative space that 
will allow students the mobility of laptops, the flexibility of differ- 
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ent table and seat configurations and space for sections of 50 to 60 
students. With the appropriate supporting infrastructure (e.g. helpful 
materials for outside of class that enhance engagement in active class 
sessions and a network of peer mentors), we hope to demonstrate 
that this project-based approach can be delivered by expending no 
more resources than many traditional lecture courses. 

Previous research has recognized the general challenge in teach¬ 
ing courses in statistics at both the undergraduate and graduate lev¬ 
el. There is no typical statistics student; instead, students come into 
statistics courses with differing backgrounds, experiences, learning 
styles and levels of preparation. This project-based course provokes 
students to encounter (and struggle with) the central concepts and 
principles not only within the discipline of statistics, but also with 
the discipline that their chosen research reflects.Although our model 
focuses on statistics education, the emphasis on authentic real-world 
activities with the goal of sparking interest and enthusiasm (BIE, 2012) 
can be achieved in curricular content as diverse as science (Kubiatko 
&Vaculova, 2011) and foreign language (Danan, 2010) instruction. In 
addition to the specific skills most directly emphasized, project-based 
courses provide students with experience in communication, organi¬ 
zation and time management (BIE, 20l2).We believe that our course 
can benefit other universities not only through dissemination of our 
model and experiences, but by making our newly developed resourc¬ 
es widely available.We are happy to share our course materials with 
others and encourage faculty to consider integrating project-based 
course content (http://passiondrivenstatistics.com). 
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